AIMET Integration#366
Conversation
kozlov721
left a comment
There was a problem hiding this comment.
could you also please include in the PR comment a few notes/justifications on the changes that are not necessarily related to the PR?
Mostly all the changes are somehow related to making AIMET work. I added more comments to some individual changes in code. If you have more questions about some other specific changes let me know.
|
Warning Review limit reached
More reviews will be available in 6 minutes and 29 seconds. Learn how PR review limits work. Your organization has run out of usage credits. Purchase more in the billing tab. ⌛ How to resolve this issue?After more reviews become available, a review can be triggered using the We recommend that you space out your commits to avoid hitting the rate limit. 🚦 How do rate limits work?CodeRabbit enforces hourly rate limits for each developer per organization. Our paid plans include higher PR review limits than trial, open-source, and free plans. In all cases, reviews become available again over time. During sustained high-volume PR review activity, CodeRabbit may temporarily slow when the next review becomes available. Please see our Fair Usage Limits Policy for further information. ℹ️ Review info⚙️ Run configurationConfiguration used: defaults Review profile: CHILL Plan: Pro Run ID: 📒 Files selected for processing (15)
📝 WalkthroughWalkthroughAdds AIMET quantization support (config, utilities, LuxonisModel.quantize, CLI, callback) and refactors loader/forward interfaces to accept dict or tensor inputs; updates Lightning export/pickling, loss/visualizer/metric modules, packaging, CI, docs, and tests. ChangesAIMET Quantization Integration
Estimated code review effort 🎯 4 (Complex) | ⏱️ ~60 minutes Suggested reviewers
Poem
🚥 Pre-merge checks | ✅ 4 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (4 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
Purpose
Adds option to quantize trained model using various PTQ and QAT techniques.
Specification
LuxonisModel.quantizemethodAIMETCallbackexporter.aimetto theConfigforwardmethod toLuxonixLightningModulefull_forwardprint_tableas a required abstract method toBaseLuxonisProgressBar__getstate__and__setstate__inLuxonisLightningModuleconfig.yamlDependencies & Potential Impact
None / not applicable
Deployment Plan
None / not applicable
Testing & Validation
AIMETCallbackintest_callbacksLuxonisModel.quantizefor all predefined models with the full set of PTQ techniques enabledSummary by CodeRabbit
New Features
Bug Fixes & Improvements
Documentation